Speaker Verification based on Single Channel Speech Separation

نویسندگان

چکیده

In multi-speaker scenarios, speech processing tasks like speaker identification and recognition are susceptible to noise overlapped voices. As the voices a complicated mixture of signals, target extraction method from this is good front end solution for further understanding classifying. The quality separation can be assessed by ratio or subjective scoring also accuracy downstream identification. order make model more adapted complex overlapping research investigates incorporate with voiceprint task. This paper proposes feature-scale single channel network connected back verification MFCCT feature, so indicates datasets prepared synthesizing Voxceleb1 data, used training testing. results show that using an objective evaluation effectively improve overall performance, as optimized significantly reduced error rate verification.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker separation using visual speech features and single-channel audio

This work proposes a method of single-channel speaker separation that uses visual speech information to extract a target speaker’s speech from a mixture of speakers. The method requires a single audio input and visual features extracted from the mouth region of each speaker in the mixture. The visual information from speakers is used to create a visually-derived Wiener filter. The Wiener filter...

متن کامل

Catalog-based single-channel speech-music separation

We propose a new catalog-based speech-music separation method for background music removal. Assuming that we know a catalog of the background music, we develop a generative model for the superposed speech and music spectrograms. We represent the speech spectrogram by a Non-negative Matrix Factorization (NMF) model and the music spectrogram by a conditional Poisson Mixture Model (PMM). By choosi...

متن کامل

A Generalized Approach for Model-based Speaker-dependent Single Channel Speech Separation

Abstract– In this paper, we present a new technique for separating two speech signals received from one microphone or one communication channel. In this special case, the separation problem is too ill-conditioned to be handled with common blind source separation techniques. The proposed technique is a generalized approach to model-based speaker-dependent single channel speech separation techniq...

متن کامل

Wavelet-based speaker change detection in single channel speech data

Speaker segmentation is the task of finding speaker turns in an audio stream. We propose a metric-based algorithm based on Discrete Wavelet Transform (DWT) features. Principal component analysis (PCA) or linear discriminant analysis (LDA) [1] are further used to reduce the dimensionality of the feature space and remove redundant information. In the experiments our methods referred to as DWT-PCA...

متن کامل

Single-speaker/multi-speaker co-channel speech classification

The demand for content-based management and real-time manipulation of audio data is constantly increasing. This paper presents a method to identify temporal regions, in a segment of co-channel speech, as being either single-speaker or multispeaker speech. The state of the art approach for this purpose is the kurtosis. In this paper, a set of complementary time-domain and frequency-domain featur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3287868